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Abstract. We survey old and new results about optimal algoritlims for summation 
of finite sequences and for integration of functions from Holder or Sobolev spaces. 
First we discuss optimal deterministic and randomized algorithms. Then we add 
a new aspect, which has not been covered before on conferences about (quasi-) 
Monte Carlo methods: quantum computation. We give a short introduction into this 
setting and present recent results of the authors on optimal quantum algorithms 
for summation and integration. We discuss comparisons between the three settings. 
The most interesting case for Monte Carlo and quantum integration is that of 
moderate smoothness k and large dimension d which, in fact, occurs in a number of 
important applied problems. In that case the deterministic exponent is negligible, 
so the n~^^^ Monte Carlo and the quantum speedup essentially constitute the 
entire convergence rate. We observe that 

• there is an exponential speed-up of quantum algorithms over deterministic 
(classical) algorithms, if k/d tends to zero; 

• there is a (roughly) quadratic speed-up of quantum algorithms over randomized 
classical algorithms, if k/d is small. 



1.1 Introduction 

We study two numerical problems: Summation of finite sequences and in- 
tegration of functions. Besides the use of deterministic algorithms both are 
prominent examples of the application of Monte Carlo methods and, quite 
recently, also of quantum computation. When we study these problems from 
the point of view of these three settings, we are usually interested in situ- 
ations at the border of feasibility, that is, for the summation problem in a 
huge number of summands iV, and for the integration problem, in a large 
dimension d. Clearly, the sum can be computed in TV — 1 additions exactly, 
but we want to understand if considerably fewer operations suffice to pro- 
vide an approximation of required precision. For integration it is clear that 
we usually cannot compute the integral exactly, hence one has to settle for 
approximations. For both problems the question arises of how good they can 
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be solved in the given settings, and how the results compare. Such an ap- 
proach incorporates two aspects: Providing good algorithms on one hand, 
and proving lower bounds - that is, bounds such that no algorithm can be 
better - on the other hand. Only when this is done in each of the settings, 
one can compare them on theoretically founded grounds. Basic tools for such 
an analysis are provided by the theory of information-based complexity, see 
Traub, Wasilkowski, Wozniakowski (1988), Novak (1988), Heinrich (1993). 

In this paper we want to give a survey of the state of the art in the above 
mentioned circle of questions. Of course, comparisons between the determin- 
istic and the randomized setting particularly well fit the topic of this confer- 
ence. On the other hand, wc have also included the new aspect of quantum 
computation. Although quantum computers exist so far only in laboratories 
and their full computational power is still hypothetic, many physicists and 
computer scientists believe in the future of quantum computing and already 
now started to investigate, in which problems a quantum computer can bring 
advantages over the classical one. It is intriguing that quantum algorithms, 
although using a completely different model of computation, also possess and 
exploit features of randomization. So it is natural to study their relation to 
Monte Carlo methods. In our investigations on quantum summation and in- 
tegration, wc found that certain Monte Carlo techniques proved to be quite 
useful for the quantum setting. We discuss this below. Since this setting is 
certainly new to many people in the MCQMC community, we chose to present 
it in a detailed way to give an insight not only into the special results but 
also into the general framework and model of computation. 

Let us now formulate the problems under consideration. We are given a 
nonempty set D, a set F of real- valued functions on D and an operator S 
from F to R. We seek to compute (approximately) S{f) for f & F, where / 
can only be accessed through its values. In this paper we shall consider the 
summation operator (or mean operator) S = Sn and the integration operator 
S = Id, which are defined as follows: For / G we define 

1 ^ 

and for an integrable / : [0, 1]"^ — > R we define 

Id{f)= I f{x)dx. 

Wc will study deterministic, randomized, and quantum algorithms for the 
computation of S^if) and Id{f), up to some error e. 

For the summation operator Sn we assume that / is from the set F = 
of all / e R^ such that 
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1 < p < 00. (Let us mention that Lp is the unit ball of the space Lp{ij), 
where /z is the equidistribution on {!,..., N}.) For p = oo we consider the 
set of all / € such that 

\f{i)\ < 1 for all i. 

For the integration operator I4 we assume that / : [0, 1]'' — > R is from a 
Holder class F = F^'" or from a Sobolev class F = W^ j^. The Holder classes 
are defined by 

{/ e C\ ll/lloo < 1, \D'f{x) - D'f{y)\ <\x- yr , x, y e [0, 1]^ |i| = k} , 

whore k G No, < a < 1, C*' stands for the set of functions / which are 
continuous together with all their partial derivatives up to order k, || 
(1 < P < 00) denotes the Lp-norm with respect to the Lebesgue measure on 
[0; l]**; \x — y\ is the Euclidean distance between x and y, and |i| means the 
sum of the components of the multiindex i. The Sobolev classes are defined 

by 

K<i = {f ■■ II^Vl|p<l, \i\<k}, 

where k G Nq, 1 < p < 00, and is here the weak partial derivative. 

For the integration problem in Sobolev spaces, we always assume the 
embedding condition 

k-p>d, (1.1) 

which guarantees, by the Sobolev embedding theorem, that the elements of 
Wp d E^re continuous functions, and hence function values are well-defined. 

Let us briefly describe the organization of the paper. In Section 2 we 
survey known results about optimal deterministic algorithms for Sn on Lp 
and Id on -F^'" and Wp ^. Section 3 is concerned with randomized (or Monte 
Carlo) algorithms for the same problems. In Section 4 we give an introduction 
into the model of quantum computation and survey recent results of Novak 
(2001) and Heinricli (2001a, b) on optimal algorithms for summation and 
integration on a quantum computer. 



1.2 Deterministic Algorithms 

We consider numerical algorithms of the form 

An{f) = V'{f{xi),...,f{Xn)), (1.2) 

where Xi G D{i = 1, . . . , n), and <p : R" — > R is an arbitrary mapping. (In 

the terminology of information-based complexity, this is the class of all non- 
adaptive, in general nonlinear algorithms using n function values.) A special 
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subclass is formed by linear algorithms, i.e. quadratures 

n 

(1-3) 



i=l 



with fli € R and Xi € D. 

The error of a method A„ of the form (1.2) is defined as 

e(A„,F) = sup|5(/)-A„(/)|. 

The central quantity for our analysis is the n-th minimal error defined for 
n e N asQ 

er(F)=infe(A„,F). 

An 

The classes L^, F^'", and VF^ ^ are unit balls in Banach spaces, so they are 
convex and symmetric. The operators Sn and Id are linear. It is known that 
under these assumptions linear methods ( |l.3| ) are optimal (even among all 
adaptive, nonlinear methods). This was proved in Bakhvalov (1971), see also 
Novak (1996) and Traub, Wasilkowski, Wozniakowski (1988). Therefore it is 
not difhcult to find an optimal method for the summation operator Sn on 

tN 



N 

i=l 



and its error 



1-1/p 



where n < N. Oi course we obtain e5^°'(L^) = for n > N and therefore 
always assume that n < N. The spaces are increasing with decreasing p 
and for the extreme cases p — oo and p = 1 we obtain 

e'nHLZ)^^ and er(Lf) = l, n<N. 

For later reference we summarize these (well known) results as follows. 
Theorem 1. Let 1 < p < oo and n < N . Then 



N 



^ Formally one can also allow the case n = with constant algorithms Aq. For all 
the problems which we study one obtains eo'^*(F) — 1, hence all the problems are 
scaled in the same way. 
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Now we discuss the integration problem. Again the results are classical, see 
Bakhvalov (1959) or references from information based complexity, such as 
Novak (1988), Traub, Wasilkowski, Wozniakowski (1988) or Heinrich (1993). 
We begin with the result for the Holder classes. 

Theorem 2. Let k eNo andO < a <l. Ther^ 

For the Sobolev classes we have the following result. 

Theorem 3. Let /c e N and 1 < p < oo, such that ( [j. j| j holds. Then 

1.3 Randomized Algorithms 

As numerical algorithms we now consider An ~ {AfJ^jen, where 

<(/) = ^"(/«), •■■,/«)), (1-4) 

{n, U,P) is a probability space, for each uj E f], xf is an element of D and 
ip'^ is a mapping from R" to R with the property that for each f E F, the 
mapping 

c.er2^A-(/) = ^-(/(.<),...,/(x-)) 

is a random variable with values in R. Again, randomized linear algorithms 
(randomized quadratures) are a special case, 

n 
i=l 

with random variables af with values in R and xf with values in D on 
{n, S, P). The error of a method (|l|) is 

e{A-,F) = sup(E(5(/) - A-(/))2) , (1.6) 
where E is the expectation. The randomized n-th minimal error is defined as 

^ We write a„ x h„ iff there are ci , C2 > and no G N such that ci &„ < a„ < C2 bn 
for all n > no. Similarly, we use a„ -< b„ if there are c > and no G N such that 
an < cb„ for all n > no . The respective constants c or Ci in this and the following 
statements may depend on the parameters k, a, d, and p, but do not depend on 
n and A'^. 
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Let us mention that in contrast to the deterministic setting no general re- 
sult about the optimality of linear methods among all methods is known for 
the randomized setting. We start with the summation operator Sn on Lp . 
Mathc (1995) found the optimal randomized summation formula for 2 < p < 
oo. It has the following form: choose randomly an n-subset {i^, . . . C 
{1, . . . N}, equidistributed on the family of all (^) n-subsets and put 

n 

where 



This c satisfies 



i{N-n) 
N -1 



1 

^ <c< -, 

/n n 



Note also that here the optimal method is linear. We summarize the known 
results on e™(L^) as follows (see Mathe (1995)). 



er(i™)- ==, 2<p<^. (1.7) 

I (jV-l)K 



Theorem 4. Let n< N. Then 

-N\ _ 

For I < p < 2 only the order of the error is known, 

eT{L'^)^n-^+^/\ (1.8) 

for N > f3n, where (3 > 1. 



In particular it follows from (1.7) that 



eT{L^)^n-'/\ 

for 2 < p < cxD and N > Pn, where /? > 1. 

We want to mention a couple of interesting facts around relation (1.8). 
One might ask if the classical Monte Carlo method (sampling independently, 
uniformly on {1, . . . , N}) yields this optimal rate for 1 < p < 2. It does not. 
The reason is that the variances of functions in , which we need for the 



error as defined in (1.6), will not be bounded uniformly in N. Even more is 



true: no linear methods (1.5) can reach this rate. Mathe (1992) proved that 



for 1 < p < 2 and N > (in the error of optimal linear methods is of the order 

min(7Vi/P-i/2^-i/2, 1) . 



However, it is easily checked that a slight (nonlinear) modification of the 
classical Monte Carlo method does give the optimal rate: replace the function 
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/ e by / defined by f{i) = f{i) if \f{i)\ < n^/P and f{i) = otherwise. 
Then apply standard Monte Carlo to /. It is also interesting that these results 
are sensitive with respect to the error criterion ( |1.6| ) . If we replace the L2 norm 

(E(5^(/)-A-(/))2)V2 

by the Lp norm 

(E(5^(/)-A-(/)f)Vf 

(or any Lg norm with 1 < q < p < 2), classical Mon te C arlo (a linear method) 
does provide the optimal rate - the same rate as ( |l.8| ). See Heinrich (1993), 
where this is shown for q = 1, but the proof gives also q — p (from which 
q < p follows trivially). Now we discuss the integration problem. For the 
Holder classes we have the following result from Bakhvalov (1959). 

Theorem 5. Let fc e Nq and < a < 1. Then 

The results for the Sobolcv classes are again due to Bakhvalov (1962), see 
also Novak (1988) and Heinrich (1993). 

Theorem 6. Let k G No, 1 < p < 00, and assume that the embedding 
condition holds. Then 

eT{W^,d) ^ n-^l"-^!^ for2<p<^, 

and 

forl<p<2. 

1.4 Quantum Algorithms 

By now it is well-known that quantum computers (if one succeeded in build- 
ing them) could yield considerable speed-ups for certain important discrete 
problems. Shor's (1994, 1998) quantum algorithm for factorization of integers 
is polynomial in the number of bits - while no such algorithm for a classical 
computer is known, and, moreover, nowadays secret codes firmly rely on the 
hope that no such algorithms exist. Grover (1996, 1998) presented a quantum 
search algorithm which finds a specified element out of N items in 0{^/N) op- 
erations, whereas classically 0{N) are necessary. These two discoveries largely 
encouraged quantum computer research and triggered a stream of further in- 
vestigations into the potential powers of quantum computers. Up to now, 
however, this research dealt almost exclusively with discrete problems. Here 
we want to know whether quantum computers are useful for problems like 
summation of reals or integrals. 
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First we introduce the model of computation. For basic notions, back- 
ground, and further material on quantum computing we refer to the sur- 
veys and monographs: Ekert, Hayden, Inamori (2000), Shor (2000), Pittenger 
(1999), Gruska (1999) and Nielsen, Chuang (2000). Let Hi be a 2-dimensional 
Hilbert space over C and let eo and ei be two orthonormal vectors in Hi. 
The space Hi represents a quantum bit, in the Dirac notation we have 

eo = |0) and ei = |1) . 

For m e N quantum bits we use the 2"*-dimensional tensor product space 

Hm=Hi® ...®Hi 

with m factors. An orthonormal basis is given by the 2™ vectors 

6f = . . . ® ej^ , 

where ij G {0, 1} and 

m 

l = Y^i^2"'-\ ^ = 0,...,2™-l. 

There are 2"* different hi and this corresponds to the 2"* different possibilities 
of an information that is given by m classical bits. The Dirac notation for bi 
is just instead of e,i ® ei^ one finds |ii,i2) or also \ii) \i2)- The formally 
different objects (ii, . . . , im) and ^ or be, are often identified and called classical 
state or basis state. Let U{Hm) denote the set of unitary operators on Hm- 
The decomposition of a; e H^ with respect to the basis {be) is given by 

x= X] Q'(»i,...,»,n)e»i O • • ■ = aebe. 

ijG{0,l} t=0 

We are only interested in normed vectors, ||a;|| = 1. All such vectors are 
called (pure) "quantum states" . For each quantum state there is a probability 
distribution on the classical states: the probability of I is ja^p. This is a 
typical feature of quantum algorithms - we cannot "read" (measure) the 
coordinates ae of a state x as above. When we measure x, the result is 
probabilistic: we obtain a classical state \() with probability \ae\'^ ■ 

Now let, as in the previous sections, D be a nonempty set, F a set of real- 
valued functions on D and let S be an operator from F to R. We describe 
in the sequel what we mean by a quantum algorithm on m qubits for the 
(approximate) computation of S. For this purpose we first need to introduce 
the notion of a quantum query. We follow Heinrich (2001a). Like the classical 
algorithms get information about the input / through function values, a 
quantum algorithm can access / through a quantum query 



g = (m',m",Z,T,/3), 
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where m' ,m" € N, m' + m" < m with m as above (the number of qubits), 
Z is a nonempty subset of {0, . . . , 2™ — 1}, and 

and 

/3:R^{0,...,2™"-1} 

are any mappings. The meaning of these components is the following: r is 

the rule by which we associate to a bit-string i G Z the node point r(i) G D 
at which we want to evaluate / (we include the case that r is not defined on 
all of {0, . . . , 2™ — 1}, which is convenient e.g. if the number of elements of 
D is finite, but not a power of 2). When we have f{T{i)), which is an element 
of R, we still need to convert it into a binary string - which is the role of the 
mapping (3. 

With a quantum query Q and an input f E F we associate a unitary 
operator Qf & U{Hm) which is defined on the basis state 

\i) \x) \y) e Hm= Hm' O Hm" O Hm —m'—m" 

as 

I / 1 / ly/ \^\i)\x)\y) otherwise. 

Here ® denotes addition modulo 2"*" . It is easily seen that Q/ is a bijection 
on the set of basis states, and hence can be extended uniquely to a unitary 
operator on H„i. 

A quantum algorithm with n quantum queries is a tuple 

An = {m,w,Q,{Ui)'!l^o,ip) , 

where m is the number of qubits (i.e. the algorithm acts on H,,-,,), w G 
is a basis state (the starting state of the algorithm), Q is a quantum query 
as defined above (supplying the information about /), Ui e U{Hm) are any 
fixed unitary operators (the quantum computations), and 

(p:{0,...,2'"-l}^R 

is an arbitrary mapping. The mapping ip produces the real number which is 
the output of the quantum computation. The algorithm An acts on input / 
as follows: It starts with the state w, to which J7o is applied, which gives Uqw. 
The mapping C/q unites all quantum operations before the first query call. 
Then Qf is applied, leading to QfUaw. Next Ui is applied (standing for the 
quantum operations between the first and the second query call), yielding 
UiQfUow. Then Qf is called again, etc. In the end the algorithm produces 
the state 

Z = UnQfUn-l . ■ . UiQfUoW . 
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Let 

z= "^./ 1^) • 

As we mentioned before, we cannot access the components aej of z di- 
rectly. The state z is measured, giving a random variable £,f{Lo) with values 
in {0, . . . , 2™ — 1}, which takes the value £ with probability Icc^./P- Finally 
the mapping ip is applied, which stands for the computations performed on 
the result of the measurement (on a classical computer). So the output of the 
algorithm is 

Note two important things: Firstly, the algorithm gets information about 
/ only through Q j, while the state w, the unitary operators Uj, and the map- 
ping ip are fixed from the beginning and do not depend on /. Secondly, each 
Ui stands, in fact, for a sequence of elementary quantum operations, called 
gates (like the basic logical operations in classical computation). The gates 
can be chosen in such a way that each unitary operator can be represented 
as a finite compositions of gates. For more details we refer to Ekert, Hayden, 
Inamori (2000). Here we are concerned with the query complexity, meaning 
that we want to study the minimal error which can be reached by algorithms 
using at most n queries (this is essentially parallel to the previous sections). 
So the number of gates needed to represent (implement) the Ui will not be 
discussed. 

What we described above is a quantum algorithm with a single measure- 
ment. One can put together several such algorithms to obtain an algorithm 
with multiple measurements: the output of the first part of the algorithm is 
used (by a classical computer) to compute the input of the second part, and 
so on. A formal description of this is given in Heinrich (2001a), where it is 
also shown that from the point of view of query complexity it makes (up to a 
factor of 2) no difference if algorithms with one or with several measurements 
are considered. 

The error of An at input / is defined probabilistically as 

e{An, f) = inf {s : P{|5(/) - A^if, ^)\ < > 3/4} , 

and 

e{An,F) = sup e{An,f). 

Note that we require an error probability not greater than 1/4. By repeating 
the algorithm k times and computing the median of the results, the error 
probability can be reduced to 2~'^'' for some c > not depending on k. 
Finally we define 
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Now we study the numbers e^(i^), e^(F^''") and e^(VFp^,rf). A ffam we 
start with the summation operator Sn on . The following result for 1 < 
p < oo is from Heinrich (2001a), in the case p = oo the upper bound is due 
to Brassard, H0yer, Mosca, Tapp (2000), the lower bound due to Nayak, Wu 
(1998). 

Theorem 7. Let n < N . Then 

el{L^)^n-^ /or2<p<oo, 
n-^ ^ el{L^) -< n-^\oi''^n loglogn, 

and 

el{Lf)^n-^+y^' forl<p<2, 

where the lower bound for e'^ {L^ ) in the last relation holds under the restric- 
tion < N . 

Next we discuss the integration problem. For the Holder classes we have the 
following result from Novak (2001). 

Theorem 8. Let /c e Nq and < a < 1. Then 

For the Sobolev classes Heinrich (2001b) proved the following result. 

Theorem 9. Let k G Nq and 1 < p < oo. // the embedding condition j| j 
holds, then 

el{W^,a)^n-^"-' 2 < p < oo , 
^-k/d-i ^ e^(W^2*^_J -< n'^'l'-^ log=^/2^1oglogn, 

and 

< n-'=/''"3/2+i/p iog3/Siloglogn 

«/ 1 < P < 2. 

We do not know the optimal order in the case 1 < p < 2, for p = 2 there is a 
logarithmic gap between the upper and lower bounds. Another gap is in the 
summation problem with 1 < p < 2 and relatively small N . Summation in 
the case p = 1 is open, as well. These questions are the subject of ongoing 
research. 

Let us finally present the results in a table and discuss some comparisons. 
In the following we omit log-factors and exhibit the asymptotic order of the 
error for < n < N jl. 

It can be read from the table that in the case 2 < p < oo Monte Carlo 
gives a speedup of n^^l"^ over deterministic algorithms both in summation 
and integration. In these cases quantum algorithms give a speedup of 
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(over deterministic algorithms), that is, the Monte Carlo gain is squared. 
A similar pattern can be found in the case 1 < p < 2 for summation of 
sequences. The speedup of Monte Carlo is here n~^~^^^P (hence the advantage 
over deterministic algorithms is reduced as p decreases). Nevertheless, in the 
quantum setting this speedup is squared again. In the case 1 < p < 2 for 
integration in Sobolev spaces, the quantum gain is still better: Even in the 
case p = 1, where Monte Carlo gives no advantage at all over deterministic 
algorithms, the quantum speedup is n^^/^. 





deterministic 


randomized 


quantum 


L^,2<p<oo 

T-ife.Q 

Wl^, 2<p<^ 
Kd^ 1 < P < 2 


1 
1 

^~{k+a)/d 

^-k/d 

^-k/d 


„-i+i/p 

^-(fe+a)/d-l/2 
n-k/d-1/2 
j^-k/d-l + l/p 


^-{k+a)/d-l 



The most interesting case for Monte Carlo and quantum integration is 
that of moderate smoothness k and large dimension^ d which, in fact, occurs 
in a number of important applied problems. In that case the deterministic 
exponent, {k + a)/d or k/d, is negligible, so the n^^^^ Monte Carlo and 
the quantum speedup essentially constitute the entire convergence rate. 
Hence we observe a situation similar to that of Grover's search algorithm: a 
quadratic speedup for quantum computation as compared to classical ran- 
domized algorithms. If we compare quantum algorithms with deterministic 
classical algorithms, then the speed-up is even much larger - it is exponential 
in the dimension d. 
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